Studies in History and Philosophy of 
Modem Physics 36, 716-723 (2005) 



Book Review 



Stephen L. Adler, Quantum theory as an emergent phenomenon, Cambridge Univer- 
sity Press, Cambridge, ISBN 0521831946, 2004, 238pp. 



Reviewed by Phihp Pearle, Hamilton College, Clinton NY 13323 (ppearle@hamilton.edu). 

The enlightenment task, of trying to explain the unnatural by the natural — in this case, the "un- 
natural" being quantum physics and the "natural" being classical physics — was begun soon after 
the codification of quantum theory, in 1926, by Louis deBroglie and Erwin Madelung. These, and 
later approaches by David Bohm, Edward Nelson and others, could be regarded as half-measured. 
Classical particles and their dynamics are re-introduced, but a strong element of the unnatural 
remains. In the deBroglie-Bohm and Madelung models, it is the mysterious quantum force. In 
the Nelson model, it is the mysterious backward diffusion process (which, together with the usual 
classical forward diffusion process, forces a particle's drift — its mean position — to be a dynamically 
determined quantity instead of, as classically, an independent variable set by external influences). 

Stephen Adler's extraordinary work is full-measured. The goal is to obtain relativistic quantum 
field theory (and, incidentally, thereby, its slow-speed limit, non-relativistic quantum mechanics) as 
a statistical mechanics canonical ensemble average of classical variables obeying classical dynamics. 
The argument is laid out very carefully, with honest and extensive discussion of requirements and 
assumptions and great attention to detail. Reading the book, I was reminded of a detective- 
adventure story, where the author initially, comfortingly, assures one of the safe outcome, but one 
has strong interest in the twists and turns of the story, and whether the author can truly resolve 
the dilemmas to one's satisfaction. In usual reviews of detective-adventure stories it is forbidden 
to give away the plot, but for this review I believe it is mandatory. Although that has aspects of 
the worst kind of "book report," I think it is the only way I can fulfill my obligation to the reader 
and convey the richness and some of the subtlety of this unique contribution. 

The basic classical dynamical variables are N x N matrices Qr, Pr- Those with complex elements are 
called bosonic variables, while those called fermionic variables have elements which are Grassmann 
numbers (complex numbers multiplying anticommutating objects). The physical assignment of 
these matrix variables is not made until much later so that the argument is always as general and 
unrestricted as it can be. However, the naming of these variables prefigures their final identification, 
i.e., a Qr will eventually be chosen to represent a field amplitude — more precisely, N"^ amplitudes — 
in a small volume surrounding the spatial point . The choices are made that the bosonic variables 
are self-adjoint and, for fermionic variables, pr = ql- 



All important quantities of classical mechanics appear here. The action, Lagrangian, Hamiltonian, 
generators of canonical transformations, are formed from the trace of products of the matrix vari- 
ables and their sum with constant (not matrix) coefficients, which is why the author calls this 
"Trace Dynamics." The derivative of a trace quantity A =Tr^ with respect to a matrix vari- 
able is defined, e.g., 5A/Sqr is a matrix, an element of which is the derivative with respect to 
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Qr^s transposed matrix element. This allows definition of a (trace) Poisson bracket, {AB} = 
Tr'^j, er[5A/6qrS^/6pr — 6'B/6qrSA/5pr] (e^ = 1 or -1, depending upon whether the r labeled 
variables are bosonic or fermionic), obeying the Jacobi identity. For an example of its useage, it 
is shown that dA/dt = dA/dt + {AH}: if one wants to extract from this the dynamical equation 
dpr/dt = —5Ti/5qr, one must apply it to A =TTprj (where j is an arbitrary constant matrix) rather 
than to A = Pr which can only give the equation of motion for the diagonal elements oi Pr- 

There are three conserved quantities of special importance. The first is the trace Hamiltonian H 
itself (with H assumed to be time-independent, self adjoint, and formed with as many fermionic 
g's as p's in its matrix products). The Poisson bracket formalism is invariant under canonical 
transformations, and H is the infinitesimal canonical generator of time translations. The second, 
N = i J2f IrPr (the sum is limited to fermionic variables), is evocatively called the "trace fermion 
number," but, of course, these are classical fields which do not describe particles. N is the in- 
finitesimal canonical generator of phase transformations of the fermionic variables, under which the 
Lagrangian is invariant. 

When a theory provides a "click," something neat which drops out of the mathematics, one's 
attention perks up. Such is the third conserved quantity — or, rather, N'^ conserved quantities — 
uncovered by Adler's student Andrew Millard, 



([,] and {,} denote commutator and anticommutator). C is traceless and anti-self-adjoint. That 

a commutator should appear in a fundamental way in this classical theory is surprising, since the 
matrix dynamical variables can take on any values and commutation plays no role. The reason 
behind these conserved quantities is that the Lagrangian, like any trace quantity, is invariant under 
unitary transformations of the g's and p's (these are special cases of canonical transformations). The 
N"^ conserved quantities making up C arise from Noether's theorem applied to the N"^ independent 
unitary transformations. It is a "thermal" average of C which will morph into ih. 

For, after dynamics, comes statistical mechanics. Introduced is the phase space measure djj, (the 
product of the independent real and imaginary parts of d{qr)ij and d{pr)ij)- Liouville's theorem 
holds: dfj, is shown to be invariant under general canonical transformations and so, in particular, 
under time evolution. The dynamics of the physical system is assumed to be complicated enough so 
that its microcanonical ensemble obeys the ergodic hypothesis, i.e., is uniformly spread out over the 
available phase space constrained by constant H, N and C. The equilibrium probability density 
distribution p of a subsystem is obtained in the usual canonical ensemble way, by maximizing 
the entropy subject to the constraints of constant H, N and C expectation values. The result 
is p = exp — [TrAC* -|- tH -|- rjN], where the Lagrange multipliers r, rj and the traceless and 
anti-self-adjoint matrix elements Ajj are inverse "temperatures" (e.g., like the chemical potential, 
which is the inverse "temperature" associated with conservation of particle number in the usual 
grand canonical ensemble). 

The mean value of any polynomial in the dynamical variables. A, is denoted (^)aV ^ / '^f^P^- A 
unitary transformation of this equation and use of the unitary invariance of d^, H and N shows 
that any {A)p^ must be a function of A, the only non-dynamical matrix in p. A can be expressed 
in any basis, so choose the basis in which it is diagonal. Choose A = C: then, in this basis, (C')av 
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is also diagonal. 



Now, a crucial assumption is made, that these diagonal elements of (C')^y all have the same 
magnitude. (The author suggests that this democratic behavior could eventually be understood 
as arising from initial conditions and/or from a deeper understanding of the dynamics.) That 
magnitude will be identified with h. Then, since C is traceless and anti-self-adjoint, must be 
even (hereby assumed) and {C)ji^ = «gff^, where igg is a diagonal matrix with N/2 elements 
+i and N/2 elements —i. The assumption is implemented by choosing the canonical ensemble so 
that A is restricted to equal i^Q multiplying a constant, the lone remaining inverse temperature 
associated with conservation of C. About the magnitude of h, surprisingly little is said, except for 
the comment "our approach implies that it has a dynamical origin." Presumably that magnitude 
depends upon the value of the associated inverse temperature. One might speculate that initial 
conditions and dynamics could determine that temperature in the same sense that they determine 
the 3° cosmic radiation temperature. 

All dynamical N x N matrices are then written, with the help of the Pauli matrix algebra, in 
2x2 block form, where the upper left A^/2 x N/2 block has i^q = i and will be responsible for 
quantum theory. The lower right x A^/2 block has i^Q = —i and will be the complex conjugate 
of quantum theory, a poor relative which has to come along for the ride. These diagonal blocks, 
which commute with igg, are labeled with the subscript eff (presumably for effective), while the 
two off-diagonal blocks, which anticommute with i^Q, are not effective for the quantum theory 
consideration. Just as anticommutation of the Pauli matrix with any 2x2 matrix removes 
the off-diagonal part of that matrix, the anticommutator {A, i^q} = "^^qqA^q projects onto the 
effective sector. 

The canonical ensemble probability density is still invariant under the subgroup of unitary transfor- 
mations which commute with i^Q. It is because the phase space measure dfi includes integrating 
over this subgroup that any (^)Ay ^ function of i^Q, which makes its upper left block just a 
number. The author wishes to exhibit (A)^y's matrix nature. This is accomplished by restricting 
the integral range of dfj, to a measure dfi, which eliminates integrating over the subgroup. This so- 
called "global unitary fixing," just requires appropriately fixing the numerical values of the matrix 
elements of one pair of bosonic variables (value of a bosonic field and its conjugate at one point 
of space), and integrating over the rest of the measure. Although the matrix (^)^y now depends 
upon the fixing, (A)^y does not: the trace average is the same as if the integral was over djj. 
instead of over dfi. It is later shown that, what is physically important, the "emergent" quantum 
theory's matrix elements, can be expressed as trace quantities. A fixing to exhibit a specific matrix 
form for (^)^y is likened to choosing a gauge in electrodynamics to exhibit a specific functional 
form for the vector potential, although what is physically important, the electromagnetic field, is 
independent of the choice of gauge. 

The next step is to consider the ensemble average of specific quantities by utilizing an analogy 
to the equipartition theorem. In standard statistical mechanics, by considering = / dixd{pp)/dp 
with p = exp —PH, carrying through the derivatives results in = 1 — 2/?(p^/2m)^Y- Here is 
considered 

= y dfiS{pA)/Sxr, 

where Xr is a Qr or pj.. (The author calls these equipartition equations "Ward identities," since 
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the Ward identity in e.g., quantum electrodynamics, involves similar manipulations.) The choice 
A =Tr{C,i^Q}W =TrC{ig|f, VF} is made, with W a polynomial in the dynamical variables. The 
derivative of p brings down three terms, involving the derivative of 1) TrAC*, 2) tH and 3) r]N, 
while the derivative of A creates two terms, one involving the derivative of 4) C, the other of 5) W. 
By anticommuting with i^q, term 1) drops out, and the rest of the terms then involve only effective 
matrices. The canonical ensemble average of the sum of the remaining four terms vanishes, but it is 
argued (by adding a term — "^Ti jj-Xr to the exponent of p, and making variations in the arbitrary 
functions jV) that even more vanishes: the canonical ensemble average of the sum of these 4 terms 
sandwiched between arbitrary polynomials in the effective dynamical variables. 

Now, as they say, the plot thickens. Appropriate to a detective story, terms 2) and 3) are bumped 
off, leaving just terms 4) and 5). Also, important approximations are made in term 5). 

Term 2) is ~ ''"^reff'^^eff^eff^eff' '^'^c!^'^ i^^ff represents the effective part of the right hand side 
of the dynamical equations (jr = er^H/^Pi- and pr = —6a/6qr. To make this vanish, it is assumed 
that is the Planck temperature, which is so large that tx^^q will be negligibly small for the 
relatively slow rates of change x^^q which we see in physics accessible to us. However, when there 
is such a high temperature, one expects that a variable's rate of change will be comparably fast, 
not slow. So, the assumption entails more. It is supposed that the Hamiltonian is such that there 
is a "mass hierarchy" which, at the top end, is governed by the Planck mass and Planck time, at 
the bottom end by our accessible physics, so that x^^q = + ^^gg^ (li^^ Brownian motion's 

superposed slow diffusion and rapid jiggles). When the ensemble average is taken, in order to make 
the other part of term 2) negligible, the one involving ^i^g^, it is assumed that, in those regions of 

phase space where it is large, Cgg which it multiplies is negligibly small. The plausibility of these 
assumptions is discussed. 

The result is unusual. Usually, it is the Hamiltonian which governs the canonical distribution, but 
here it is C which dominates. Something else is achieved. One might have wondered how it is 
possible to have a Lorentz invariant theory if the system is in a thermal bath, since a thermal bath 
exists in a preferred frame (in which the momentum and angular momentum vanish — it is suggested 
that this could be the local co- moving frame) . But here, the bath and the associated Hamiltonian, 
which is not a Lorentz invariant, are out of it: it is C which reigns, and that is Lorentz invariant. 
What is missing is the choice of Hamiltonian which entails this behavior. The author says, "this is 
a task for the future," a not unreasonable position, given the ever-present difficulty of finding the 
right Hamiltonian for particle physics. 

Term 3), ~ ^^reff'^'^^eff^eff^eff' Isolds only for fermionic variables Xr- ij is chosen to vanish. It 
is pointed out that, since p does then not depend upon N, this implies that the ensemble average 
of N vanishes, since an exchange of the fermionic p^'s and g^'s in the ensemble average integral 
changes the sign of N but leaves dfi and p unaffected. However, no fundamental reason for this 
choice is given. It seemed to me that a reasonable argument might be to make a parallel with the 
chemical potential in the usual grand canonical ensemble, which may be set equal to zero, reducing 
it to the canonical ensemble, if there is no particle exchange between the system and its bath. If 
the dynamical variables of the subsystem here represent fields localized at points in a volume of 
space, and the bath is the fields in the surrounding volume, there is indeed no motion of variables 
from one volume to the other, and N may be fixed for the subsystem, allowing r] = 0. 
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Term 5) arises from TtC^qSW. It is assumed that the subsystem under consideration is so large 
that fluctuations in are small, so that may be replaced by (Cgg-)^Y = *Qff^- It is also 
assumed that, in the ensemble averages, VFg|f(xr) ~ W{Xj.qq) to great accuracy. I thought this 
assumption not as obviously attainable as the rest because the product of an even number of x^'s 
has the ineffective off-diagonal blocks contributing to effective diagonal blocks. This assumption 
implies that, nonetheless, the off-diagonal block contribution to the slow dynamics is negligibly 
small. 



The result of these assumptions is the vanishing of the canonical ensemble average of 

W[^eff'^.eff]±^^^^ (1) 
'^^reff 

sandwiched between arbitrary polynomials in the dynamical variables (x^ is the variable conjugate 
to Xr, and the sign is - for bosonic Xr = Qr and + otherwise). Eq.(l) represents a "derivation," if 
you will, of Dirac's ad hoc rule of equivalence of a quantum commutator bracket and a classical 
Poisson bracket. 



With = -f^gff, Eq-(l) gives the (sandwiched, averaged) vanishing of 

W[^eff'^r-eff] -^^reff- (2) 

If you were wondering how dynamics can come from averaging over a static canonical distribution, 
here's how. Recall that Xr represents the function of a:;'s given by the Poisson bracket equation 
of motion. That equation of motion determines the underlying dynamical solution Xr{t), whose 
effective part can be taken at any time. Eq.(2) says that [a^^gff(i + ^t) — x^gg(i)]/Ai is just what 
is given by the Heisenberg equation of motion in quantum theory. Neat, huh? (Appropriate to 
an adventure story, the line from the song in "Casablanca" comes to my mind, "The fundamental 
things apply as time goes by.") From this, the (sandwiched, averaged) usual Heisenberg equation 
of motion for any polynomial of the x's follows. 



With =constant X x^gg (the constant is a c- number for a bosonic variable, a Grassmann 

number for a fcrmionic variable), Eq.(l) yields the (sandwiched, averaged) canonical commutation 
(anticommutation) relations for bosonic (fermionic) variables. 

With Wgff such that the infinitesimal generator of a canonical transformation is W, Eq.(l) implies 
that this canonical transformation for the effective variables can be implemented by a unitary 
transformation with W^q as the infinitesimal generator. 

It is now just a short hop to quantum field theory. H^q is restricted to have a nondegenerate lowest 
eigenvalue with eigenvector i/jq. Then, the correspondence is made that the upper left block of 

i^oiM^eS)) " (vac|A(X)|vac), (3) 

where A is any polynomial in a^^gff 's, interpreted (as previously mentioned) as quantum fields. The 
demonstrated properties of the left side of Eq.(3) imply that it behaves as a Wightman function, 
i.e., the right side of Eq.(3) with X's as quantum fields, in terms of which local relativisitic quantum 
field theory can be expressed. 



5 



To show that the left side of Eq.(3) can be written as an ensemble average of a trace and so, as 
promised, is independent of the gauge-like "fixing," it is noted that i/jqiI^q = (27ri)^^ f dz{z — H^Q)~^, 
where the integration is over an infinitesimal contour surrounding the lowest eigenvalue of H^q. 

Then, the left side of Eq.(3) is (Tr^(Xgff)t/'oV'o)^y- 

It is now straightforward to go from the Hciscnbcrg picture just obtained to the Schrodinger picture 
and Schrodinger's equation, utilizing the demonstrated equivalence of the canonical time translation 
generated by H, and the unitary time translation generated by -ffgg • 

This is how the algebra of quantum theory "emerges" from the statistical mechanics of the classical 
trace dynamics. However, there is still one crucial thing missing from giving quantum theory entire: 
the Born probability interpretation. 

This is achieved by tapping into the lexicon of dynamical wavefunction collapse models first pro- 
posed by this reviewer in the late 1970's, and well developed since. In these models, the Schrodinger 
equation is altered by adding terms to the Hamiltonian, most importantly, an anti-unitary oper- 
ator multiplied by a randomly fluctuating function of white noise type (or a set of such mutually 
commuting terms). This makes a superposition of states (eigenstates of the anti-unitary operator 
or operators) dynamically evolve to one of the states. Which state survives depends upon the 
particular white noise function. The probability associated with a particular final state is that 
associated to all the white noise functions which cause it, and is equal to the Born probability (i.e., 
to the squared magnitude of the state's coefficient in the initial superposition). 

To achieve a collapse dynamics, fluctuations of C are taken into account. The assumption that 
C may be replaced by (C*) Ay = ^eff^ ™ term 5) of the "Ward identity" is replaced by C = igff^ 
plus a "fast" fluctuation, the latter assumed to be well approximated by white noise behavior. 
However, it is necessary that this fluctTiation be self-adjoint if stochastic collapse dynamics is to be 
obtained. It is pointed out that the anti-self-adjointness of (7 is a consequence of the choice pr = ql 
for fermionic variables. The more general choice of p,- = qlAgr, with each Agr an N x N matrix 
(and J^sr = Ars) allows C to have a self-adjoint part. So, the substitution C = iQQh[l + JC{t)+J\f{t)] 
is made in term 5) of the "Ward identity," with ]C{t) a c-number and M{t) a matrix. 

This adds /C(t) and A^(i)-dependent terms to Eqs.(l) and (2). Specializing to the upper left block 
where i^g = i, the nonrelativisitic limit is taken, with the fermionic x^'s replaced by quantum flelds 
*reff' ^reff ^'^^'^'^ annihilate and create a fermion at location r. The modifled Eqs.(2) for ^^gff 
and ^l^gff , acting on the vacuum state, are taken to be the (unsandwiched, unaveraged) field theory 

equations of motion, corrected for when there are fiuctuations in C. The reason for the "acting 

on the vacuum state" proviso is that when i/C(t) has a real part and iM{t) has a self- adjoint part, 
necessary if these are to be white-noise dependent terms responsible for collapse, then the equation 
for is not the adjoint of the equation for *^gff . 

The equation for ^'^gff acting on |vac) vanishes identically. ^|^^u|vac)'s equation of motion is 

converted to an equation of motion for the Schrodinger picture statevector which is a super- 

position of products of ^'^^g-'s acting on |vac). This conversion requires the assumption that M{t) 
is the sum of operators each associated with a single r, i.e., N{t) = Ylir-^r{t) (^(*) is specialized 
so that A/'r(i)|vac) = 0). Neglecting the (non-collapsing) effect of real }C{t) and self-adjoint N{t), 
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with /Ci(t) =Im/C(i) and Mi{t) = {i/2)J2r'mr[K{t) - Mj{t)], the statevector satisfies 



t)/dt = h-^ [-ii/gff - /Ci {t)H^Q - {l/2)Mi{tm, t) (4) 

Eq.(4) does not yield d{^^t\^,t) / dt = 0, but it is shown that this should be true for the choice 
of the more general C. This discrepancy is attributed to neglect of nonlinear terms in when 
arriving at Eq.(4). With addition of such terms to give constant t), and the assumption that 

resulting collapse theory's density matrix evolution equation is of the Lindblad-Kossakowski type 
(so, for example, there is no superluminal signalling), the additional terms are uniquely determined. 

The /Ci(t) term in Eq.(4) yields collapse to -ffpff's eigenstates. Collapse to energy eigenstates, first 
proposed by Donald Bedford and Derek Wang in the mid 1970's, has since then been advocated 
in formulations by Gerald Milburn, Ian Pcrcival, Daniel Fivcl and most strongly perhaps by Lane 
Hughston, and has been explored by Stephen Adler and colleagues, with resulting experimental 
limits on the size of this term summarized in this book. I have recently given a number of arguments 
that energy-driven collapse cannot give the behavior appropriate for a collapse model capable of 
describing a macroscopic world consistent with our experience. For the simplest of examples, an 
isolated macroscopic object in a superposition of two locations will remain in such a superposition 
since the two states have precisely the same energy spectrum (it is differences in energy spectra 
that allows one state to be selected as the end product of collapse over another ). Adler agrees 
with this position, although the /Ci(t) term may be there. 

So, he turns to discuss the M.i{t) mass-density proportional term. It is not precluded from being 
specialized to the form utilized by my 1989 Continuous Spontaneous Localization (CSL) collapse 
model, based upon combining my previous work with aspects of GianCarlo Ghirardi, Alberto Rim- 
ini, and Tullio Weber's 1986 Spontaneous Localization (SL) model. CSL docs provide a satisfactory 
description of the macroscopic world, with experimental outcomes occurring as predicted by the 
Born probability rule. In this way, the theory presented here can "explain" both the structure of 
quantum theory and its interpretation. 

There you have it, every vicissitude overcome, every barrier gone over as well as either through or 
around, with impressive ingenuity, range and resourcefulness. Is this the long sought formulation 
which makes quantum theory understandable? I'd say a definite "maybe" or, to paraphrase Dr. 
Seuss in his moral talc of Horton, "it could be, it could be, it could be like that." The author 
clearly was struck by the striking features of trace dynamics and its potential for attaining the 
"emergent" quantum theory grail. The argument satisfyingly flows along, conveying that convic- 
tion, occasionally with reservations, to the reader. The whole thing rests on the properties of an 
unknown Hamiltonian. The assumptions about its behavior pile up, but they almost always appear 
natural, especially because, for the most part, they are exhaustively and evenhandedly explored so 
one is led to an appreciation of their necessity and even inevitability if the scheme is going to work. 
Lacking the right Hamiltonian, one might nevertheless hope that the future can bring pursuit of 
"toy models" to elucidate and confirm one or another feature of this remarkable construction. 

In his contribution to Paul Schilpp's collection of essays, "Albert Einstein: Philosopher-Scientist," 
Einstein wrote his essay in 1949 at the Princeton Institute for Advanced Study, repeatedly referring 
to the "statistical quantum theory." He predicts that it "would, within the framework of future 
physics, take an approximately analogous position to the statistical mechanics within the framework 
of classical mechanics." How apt it is in this year of homage to Einstein to consider that the Albert 
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Einstein Professor at the Princeton Institute for Advanced Study for 24 years, Stephen Adler, has 
produced just such a structure. 
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